Search CORE

3 research outputs found

On the regularity of cylindrical algebraic decompositions

Author: Locatelli Acyr
Publication venue
Publication date: 15/04/2016
Field of study

Regular cylindrical algebraic decomposition

Author: Davenport James
Locatelli Acyr
Sankaran Gregory
Publication venue
Publication date: 25/02/2020
Field of study

We show that a strong well-based cylindrical algebraic decomposition P of a bounded semi-algebraic set S is a regular cell decomposition, in any dimension and independently of the method by which P is constructed. Being well-based is a global condition on P that holds for the output of many widely used algorithms. We also show the same for S of dimension at most 3 and P a strong cylindrical algebraic decomposition that is locally boundary simply connected: this is a purely local extra condition.</p

OPUS

Pushing Mixture of Experts to the Limit: Extremely Parameter Efficient MoE for Instruction Tuning

Author: Ahmadian Arash
Ermiş Beyza
Hooker Sara
Locatelli Acyr
Zadouri Ted
Üstün Ahmet
Publication venue
Publication date: 11/09/2023
Field of study

The Mixture of Experts (MoE) is a widely known neural architecture where an ensemble of specialized sub-models optimizes overall performance with a constant computational cost. However, conventional MoEs pose challenges at scale due to the need to store all experts in memory. In this paper, we push MoE to the limit. We propose extremely parameter-efficient MoE by uniquely combining MoE architecture with lightweight experts.Our MoE architecture outperforms standard parameter-efficient fine-tuning (PEFT) methods and is on par with full fine-tuning by only updating the lightweight experts -- less than 1% of an 11B parameters model. Furthermore, our method generalizes to unseen tasks as it does not depend on any prior task knowledge. Our research underscores the versatility of the mixture of experts architecture, showcasing its ability to deliver robust performance even when subjected to rigorous parameter constraints. Our code used in all the experiments is publicly available here: https://github.com/for-ai/parameter-efficient-moe

arXiv.org e-Print Archive